Performance Analysis of MEC Approach for Haplotype Assembly
نویسندگان
چکیده
The Minimum Error Correction (MEC) approach is used as a metric for reconstruction of haplotypes from NGS reads. In this paper, we show that the MEC may encounter with imprecise reconstructed haplotypes for some NGS devices. Specifically, using mathematical derivations, we evaluate this approach for the SOLiD, Illumina, 454, Ion, Pacific BioSciences, Oxford Nanopore, and 10X Genomics devices. Our results reveal that the MEC yields inexact haplotypes for the Illumina MiniSeq, 454 GS Junior+, Ion PGM 314, and Oxford Nanopore MK 1 MinION.
منابع مشابه
Self-organizing map approaches for the haplotype assembly problem
Haplotype assembly is to reconstruct a pair of haplotypes from SNP values observed in a set of individual DNA fragments. In this paper, we focus on studying minimum error correction (MEC) model for the haplotype assembly problem and explore self-organizing map (SOM) methods for this problem. Specifically, haplotype assembly by MEC is formulated into an integer linear programming model. Since th...
متن کاملHigh-Performance Haplotype Assembly
The problem of Haplotype Assembly is an essential step in human genome analysis. It is typically formalised as the Minimum Error Correction (MEC) problem which is NP-hard. MEC has been approached using heuristics, integer linear programming, and fixedparameter tractability (FPT), including approaches whose runtime is exponential in the length of the DNA fragments obtained by the sequencing proc...
متن کاملHaplotype reconstruction from SNP fragments by minimum error correction
MOTIVATION Haplotype reconstruction based on aligned single nucleotide polymorphism (SNP) fragments is to infer a pair of haplotypes from localized polymorphism data gathered through short genome fragment assembly. An important computational model of this problem is the minimum error correction (MEC) model, which has been mentioned in several literatures. The model retrieves a pair of haplotype...
متن کاملSolving Haplotype Assembly Problem Using Harmony Search
Single Nucleotide Polymorphisms (SNPs), a single DNA base varying from one individual to another, are believed to be the most frequent form responsible for genetic differences. Haplotypes have more information for disease-associating than individual SNPs or genotypes; it is substantially more difficult to determine haplotypes through experiments. Hence, computational methods that can reduce the...
متن کاملTowards High-performance Haplotype Assembly for Future Sequencing
The problem of Haplotype Assembly is an essential step in human genome analysis. Being the well known MEC model for its solution NP-hard, it is currently addressed by using algorithms that grow exponentially with the length of DNA fragments obtained by the sequencing process. Technological improvements will reduce fragmentation, increase fragment length and make such computational costs worst. ...
متن کامل